Search

Summary

Abstract

This chapter describes progress in building computer systems that understand people, and can work with them in the manner of an attentive human-like assistant. To accomplish this, I have built a series of real-time experimental testbeds, called Smart Rooms. These testbeds are instrumented with cameras and microphones, and perform audio-visual interpretation of human users. Real-time capabilities include 3D tracking of head, hands, and feet, and recognition of hand/body gestures. The system can also support face recognition and interpretation of face expression.

Introduction

My goal is to make it possible for computers to function like attentive, human-like assistants. I believe that the most important step toward achieving this goal is to give computers an ability that I call perceptual intelligence. They have to be able to characterize their current situation by answering questions such as who, what, when, where, and why, just as writers are taught to do.

In the language of cognitive science, perceptual intelligence is the ability to solve the frame problem: it is being able to classify the current situation, so that you know what variables are important, and thus can act appropriately. Once a computer has the perceptual intelligence to know who, what, when, where, and why, then simple statistical learning methods have been shown to be sufficient for the computer to determine which aspects of the situation are significant, and to choose a helpful course of action [205].

Summary

Introduction

The luminance of a surface results from the combined effect of its reflectance (albedo) and its conditions of illumination. Luminance can be directly observed, but reflectance and illumination can only be derived by perceptual processes. Human observers are good at judging an object's reflectance in spite of large changes in illumination; this skill is known as “lightness constancy”.

Most research on lightness constancy has used stimuli consisting of grey patches on a single flat plane. The models are typically based on the assumption that slow variations in luminance are due to illumination gradients, while sharp changes in luminance are due to reflectance edges. The retinex models for use with “Mondrian” stimuli are good examples (Horn, 1974; Land & McCann, 1971). But in three dimensional scenes, sharp luminance changes can arise from either reflectance or from illumination, as illustrated in Figure 11.1. The edge marked (1) is due to a reflectance change, such as might result from a different shade of paint. The edge marked (2) results from a change in surface normal which leads to a change in the angle of incidence of the light – an effect that we may simply refer to as “shading.” As Gilchrist and his colleagues have emphasized (Gilchrist et al., 1983), three-dimensional scenes introduce large and important effects that are completely missed in the traditional approach to lightness perception.

Intrinsic image analysis

Using the terminology of Barrow & Tenenbaum (1978) we may cast the perceptual task as a problem of computing intrinsic images – images that represent the underlying physical properties of a scene.

Search Results

Refine search

Refine search

Actions for selected content:

2 results

1 - Smart Rooms: Machine Understanding of Human Behavior

Summary

11 - The perception of shading and reflectance

Summary

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

2 results

1 - Smart Rooms: Machine Understanding of Human Behavior

Summary

11 - The perception of shading and reflectance

Summary